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Abstract 

In this note we provide a concise report on the complexity of the causal order¬ 
ing problem, originally introduced by Simon to reason about causal depen¬ 
dencies implicit in systems of mathematical equations. We show that Simon’s 
classical algorithm to infer causal ordering is NP-Hard—an intractability pre¬ 
viously guessed but never proven. We present then a detailed account based 
on Nayak’s suggested algorithmic solution (the best available), which is domi¬ 
nated by computing transitive closure—bounded in time by 0 (| V| ■ |iS|), where 
iS(£^, V) is the input system structure composed of a set £ of equations over 
a set V of variables with number of variable appearances (density) |iS|. We 
also comment on the potential of causal ordering for emerging applications 
in large-scale hypothesis management and analytics. 

Keywords: Causal ordering. Causal reasoning. Structural equations. 
Hypothesis management. 


1. Introduction 

The causal ordering problem has long been introduced by Simon as a tech¬ 
nique to infer the causal dependencies implicit in a deterministic mathemati¬ 
cal model [l|. For instance, let and f2{xi, X2) be two equations defined 

over variables xi,X2- Then the causal ordering problem is to infer all existing 
causal dependencies, in this case the only one is {xi,X2), read ^X2 causally 
depends on xid It is obtained by hrst matching each equation to a variable 
that appears in it, e.g., /2 e-)- X2- Intuitively, this means that /2 is to be 
assigned to compute the value of X2 —using the value of Xi, which establishes 
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a direct causal dependency between these two variables. Indirect dependen¬ 
cies may then arise and can be computed, which is specially useful when the 
system of equations is very large. 

Causal ordering inference can then support users with uncertainty man¬ 
agement, say, towards the discovery of what is wrong with a model for en¬ 
abling efficient and effective modeling intervention. If multiple values of Xi 
are admissible for a modeler, then as a user of the causal ordering machin¬ 
ery she has support to track their influence on the values of X 2 - One major 
application for that is probabilistic database design j^. 

Back in the SOth’s, Simon was motivated by studies in econometrics and 
not very concerned with the algorithmic aspects of the Causal Ordering Prob¬ 
lem (COP). Yet his vision on COP and its relevance turned out to be influen¬ 
tial in the artihcial intelligence literature. In a more recent study. Dash and 
Druzdzel revisit and motivate it in light of modern applications [^. They 
show that Simon’s original algorithm, henceforth the Causal Ordering Algo¬ 
rithm (COA), is correct in the sense that any valid causal ordering that can 
be extracted from a self-contained (well-posed) system of equations must be 
compatible with the one that is output by COA |3|. Their aim has also been 
(sic.) to validate decades of research that has shown the causal ordering to 
provide a powerful tool for operating on models. In addition to the result 
on the correctness of COA, their note also provides a convenient survey of 
related work that connects to Simon’s early vision on causal reasoniM. 

However, Simon’s formulation of COP into COA—originally in [l|, and 
reproduced in [^, turns out to be intractable. There is a need to distinguish 
Simon’s COA from COP itself. The former still seems to be the main entry 
point to the latter in the specialized literature. In fact, there is a lack of a 
review on the computational properties of COA—and as we show in this note, 
it tries to address an NP-Hard problem as one of its steps. The interested 
reader who needs an efficient algorithmic approach to address COP in a real, 
large-scale application can only scarcely hnd some comments spread through 
Nayak j3, p. 287-91], and then Iwasaki and Simon j^, p. 149] and Pearl 
[^, p. 226] both pointing to the former. Regarding Simon’s COA itself, the 
classical approach to COP, it is only Nayak who si^gests in words that (sic.) 
‘[it] is a worst-case exponential time algorithm’ |7|, p. 37]. We discuss this 
ambiguity that exists in the most up-to-date literature shortly in 1 11.21 

COP is signihcant also in view of emerging applications in large-scale 
hypothesis management and analytics [^. The modeling of physical and 
socio-economical systems as a set of mathematical equations is a traditional 
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approach in science and engineering and a very large bulk of models exist 
which are ever more available in machine-readable format. Simon’s early 
vision on the automatic extraction of the “causal mechanisms” implicit in 
(large-scale) models for the sake of informed intervention hnds nowadays 
new applications in the context of open simulation laboratories Jsl, lar ge-s cale 
model management and online, shared model repositories [lO, QQ. 

In this paper we review the causal ordering problem (§2]). Our core contri¬ 
butions are to originally show that COA aims at addressing an NP-Hard 
problem, conhrming Nayak’s earlier intuition; and then (Q to organize into 
a concise yet complete note his hints to solve COP in polynomial time. 


1.1. Informal Preliminaries 

Given a system of mathematical equations involving a set of variables, the 
causal ordering problem consists in detecting the hidden asymmetry between 
variables. As an intermediate step towards it, one needs to establish a one- 
to-one mapping between equations and variables [l|. 

For instance, Einstein’s famous equation E = me? states the equivalence 
of mass and energy, summarizing (in its scalar version) a theory that can be 
imposed two different asymmetries for different applications. Say, given a 
hxed amount of mass m = mo (and recalling that c is a constant), hnd the 
particle’s relativistic rest energy E] or rather, given the particle’s rest energy, 
hnd its mass or potential for nuclear hssion. That is, the causal ordering 
depends on what variables are set as input and which ones are “inhuenced” 
by them. Suppose there is uncertainty, say, a user considers two values to set 
the mass, m = mo or m = m^. Then this uncertainty will how through the 
causal ordering and ahect all variables that are dependent on it (energy E). 

For really large systems, having structures say in the order of one million 
equations [1^, the causal ordering problem is critical to provide more specihe 
accountability on the accuracy of the system—viz., what specihe variables 
and subsystems account for possibly inaccurate outcomes. This is key for 
managing and tracking the uncertainty of alternative modeling variations 


systematically [8|, [13 


1.2. Related Work 

COA. Dash and Druzdzel provide a high-level description of how 
Simon’s COA [l| proceeds to discover the causal dependencies implicit in a 
structure. It returns a ‘partial’ causal mapping: from partitions on the set 
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of equations to same-cardinality partitions on the set of variables—a ‘total’ 
causal mapping would instead map every equation to exactly one variable. 

They show then that any valid total causal mapping produced over a 
structure must be consistent with COA’s partial causal mapping. Nonethe¬ 
less, no observation at all is made regarding COA’s computational properties 
in |3|, leaving in the most up-to-date literature an impression that Simon’s 
COA is the way to go for COP. In this note we show that Simon’s COA 
tries to address an NP-Hard problem in one of its steps, and then clearly 
recommend Nayak’s efficient approach to COP as a £x to COA. 

COP. Inspired by Serrano and Gossard’s work on constraint modeling 
and reasoning [l^, Nayak describes an approach that is provably efficient 
to process the causal ordering: extract any valid total causal mapping and 
then compute the transitive closure of the direct causal dependencies, viz, 
the causal ordering. Nayak’s is a provably correct approach to COP, as all 
valid ‘total’ causal mappings must have the same causal ordering. 

In this note we arrange those insights into a concise yet detailed recipe 
that can be followed straightforwardly to solve COP efficiently. 

2. The Causal Ordering Problem 

We start with some preliminaries on notation and basic concepts to eventu¬ 
ally state COP formally. 

For an equation f{xi, X2, ■■■,Xi) = 0, we will write Vars{f) — {xi, X2, ..., xi] 
to denote the set of variables that appear in it. 

Def. 1. A structure is a pair iS(T, V), where £ is a set of equations over a 
set V of variables, V = U/g£: such that: 

(a) In any subset £' O £ of k > 0 equations of the structure, at least k 
different variables appear, i.e., \£'\ < \V'\; 

(b) In any subset of k > 0 equations in which r variables appear, k <r, if 
the values of any {r — k) variables are chosen arbitrarily, then the values 
of the remaining k variables can be determined uniquely—finding these 
unique values is a matter of solving the equations. 

Note in Def. [Uthat structures are composed of equations, and variables are 
only part of them indirectly as part of equations. Accordingly, set inclusion 
and all set operations such as union, intersection and difference are computed 
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w.r.t. the equations. That is, if iS(£^, V) and S\£', V) are structures, then we 
write S' G S when S' G S. An additional operation for ‘variables elimination’ 
shall be used. We write T := 5 5', to denote a structure T resulting 

from both (i) removing equations S' from S, and (ii) enforcing elimination 
of variables V' = U/g£:' V'^rs{f) from S \ S'. 

Def. 2. Let S{S, V) be a structure. We say that S is self-contained or com¬ 
plete if \S\ = |V|. 


In short, COP will be concerned with systems of equations that are ‘struc¬ 
tural’ (Def. [T]) and ‘complete’ (Def. |2]), viz., that have as many equations as 
variables and no subset of equations has fewer variables than equations0 
Now Def. [3] introduces a data structure that is an intermediate product 
towards addressing COP. We shall state COP formally in the sequel. 


Def. 3. Let S{S,V) be a complete structure. Then a total causal mapping 
over S is a bijection ip ■. S ^ V such that, for all f & S, if (p{f) = x then 
xG Vars{f). 


Note that such total causal mapping tp induces a set of direct causal de¬ 
pendencies (see Eq. [T]), which shall give us the causal dependencies (Def. 0]). 

C^ = { {xa,Xb) I there exists f E S such that (p{f) = Xb and Xa G Vars{f) } (1) 


Def. 4. Let iS(T, V) be a structure with variables Xa,Xb G V, and ip a total 
causal mapping over S inducing set of direct causal dependencies C^p and 
indirectly a transitive closure We say that {xa,Xb) is a direct causal 
dependency in S if (xa,Xb) G C^, and that {xa,Xb) is a causal depen¬ 
dency in S if {Xa,Xb) G C+. 

In other words, {xa,Xb) is in iff Xb direct and causally depends on Xa, 
given the causal asymmetries induced by ip. Then by transitive reasoning on 
Cp we shall be able to infer the transitive closure which is the causal 
ordering. Now we can state COP more formally as Problem [1] 


'^Also, for inferring causal ordering the systems of equations given as input is expected 
to be ‘independent,’ i.e., can only have non-redundant equations. 
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Problem 1. (COP). Given a complete structure S{S, V), find a total causal 
mapping tp over S and derive a set of causal dependencies induced by it. 

In the sequel we shall see two different algorithmic approaches to COP 
(Problem [1]). First, the classical approach informally described by Simon 
in the SOth’s [l|, and reproduced recently in j^; and then Nayak’s one pro¬ 
posed in the 90th’s j^. We shall present the algorithms and analyze their 
corresponding complexities. 

3. Simon’s Causal Ordering Algorithm and its Complexity 

We introduce now additional concepts that are specific to Simon’s COA. 

Def. 5. Let S be a structure. We say that S is minimal if it is complete 
and there is no complete substructure S'G S. 

Example 1. Consider structure S{E^V), where E = { ffixi), f 2 {x 2 ), fsix^), 
ffixi,X 2 ,Xs,Xi,X 5 ), f 5 {xi,X 3 ,X 4 ,X 5 ), fe{x 4 ,XQ), f 7 {x 5 ,X 7 )}. Note thats is 
complete, as |£^| = |V| = 7, but not minimal. There are exactly three minimal 
substructures Si,S 2 ,S 3 C S, whose sets of equations are = {/i(a;i)}, E 2 = 

{f2{x2)}, E3 = {f3{x3)}. □ 

Now Lemma [T] and Proposition [T] are stated to back up a ‘disjointness’ 
assumption that is made by COA upon minimal structures (Def. E]). The 
proof we present here for Proposition [1] is a conveniently derived alternative 
to Simon’s own proof to his original ‘theorem 3.2’ [l|, p. 59]. 

Lemma 1. Let Si{Ei,Vi) and S 2 {E 2 ,V 2 ) be structures. If VinV 2 = 0 then 
iSi n 1 S 2 = 0 (i.e., Eir\E 2 = 0 ). That is, disjointness of variables is strong 
enough to warrant disjointness of equations. 

Proof 1. Let Vi H V 2 = 0. Now by contradiction assume SinS 2 0, then 
there must be at least one shared equation f G Ei,E 2 . Since both Si,S 2 are 
structures, by Def. [2] we know that \Vars{f)\ > 1 and Vars{f) C Vi n V 2 . 
Yet Vi n V 2 = 0. j?. Therefore Vi fl V 2 = 0 implies 81(182 = 0. □ 

Def. 6. Let iSi(£^i, Vi) and 82 ( 82 , 1 ^ 2 ) be structures. Then we say that they 
are disjoint if Vi H V 2 = 0. 
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Proposition 1. Let S be a complete structure. If 81,82 C 8 are any differ¬ 
ent minimal substructures of 8 , then they are disjoint. 


Proof 2. We show the statement by case analysis and then contradiction out 


of Defs. [2H1 and Defs. 0-0 See \Appendix A 


□ 


Simon’s COA is also based on a data structure introduced in Def. [71 


Def. 7. The structure matrix As of a structure 8 {£, V), with fi, /2, /„ G 
£ and xi, X 2 , ...^Xm eV, is a \£\ x |V| matrix of I’s and O’s in which entry 
Oij is non-zero if variable Xj appears in equation ft, and zero otherwise. 

Elementary row operations on the structure matrix may hinder the struc¬ 
ture’s causal ordering and then are not valid in general [H . This also empha¬ 
sizes that the problem of causal ordering is not about solving the system of 
equations of a structure, but identifying its hidden asymmetries. 


3.1. Simon’s Causal Ordering Algorithm 

Simon has described his Causal Ordering Algorithm (COA) only informally 
at a high level of abstraction [l|. It is given a complete structure 8 {£, V) and 
computes a causal mapping ip. The causal ordering itself is to be obtained 
as a post-processing (transitive closure) out of the causal mapping p and its 
induced set of direct causal dependencies. Example [T] (continued) warms 
up for Simon’s algorithm. 

Example 1. (continued). Fia. [Ta\ shows the matrix of the structure 8 given 
above in this example. By eliminating the variables identified with the mini¬ 
mal substructures 81 , 82 , 8 ^ C 8 , a smaller structure T is derived to be input 
at the next recursive step (see Fig. W- This is the main insight of Simon’s 
to arrive at his recursive causal ordering algorithm, as described next. □ 


Algorithm [T] describes the variant of Simon’s original description that 
returns a ‘total’ causal mapping (satishes Def. in])@ We refer to its core 


^This slight variation takes place in lines 7-10 of RTCM in Algorithm[I] and is irrelevant 
to its intractability—which we shall see is due to line 3. Besides, ‘total’ and ‘partial’ causal 
mappings are interchangeable [sj. In particular, recovering the latter from the former is 
straightforward: just merge ‘strongly coupled’ variables in a cluster. Intuitively, these are 
variables whose values can only be determined simultaneously. To be precise, let Xi,X 2 G V 
be variables in a structure S{£, V). We say Xi, X 2 are strongly coupled if S is minimal. 
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xi X2 X3 X4 xq xe xr 

/i 1 0 0 0 0 0 0 

/2 0 1 0 0 0 0 0 

/a 0 0 1 0 0 0 0 

/4 1 1 1 1 1 0 0 

/s 1 0 1 1 1 0 0 

/e 0 0 0 1 0 1 0 

/t 0 0 0 0 1 0 1 
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Xi X 2 X 3 



Xq X 7 


(a) Structure matrix. 


(b) RTCM’s run in 3 steps. (c) Causal graph G^. 


Figure 1: Simon’s RTCM, the core procedure in COA. Fig. [Tal a structure 
matrix given. Fig. [ibl minimal substructures detected in each recursive 
step k are highlighted in shades of gray and have their diagonal elements 
colored. Fig. [T3 Causal graph G^p induced by mapping tp over structure 
S. An edge connects a node Xi towards a node Xj, with Xj,Xj G V, iff Xj 
appears in the equation f & S such that (p{f) = Xj. As the mapping p 
is not unique, accordingly the causal graph G^p is not either—e.g., consider 
p' with /4 I— )■ X 5 and /s 1 —)■ X 4 . The induced graph Gp would have, e.g., a 
connection from X 2 to X 5 instead. Yet their graph transitive closure is the 
same, tc{Gp) = tc{Gp), as we shall see in ^ 


procedure as RTCM (recursive total causal mapping). It comprises the actual 
source of intractability in Simon’s original description, while lines 3-7 of the 
COA procedure were not described by himself but only considered as matter 
of a post-processing. We illustrate RTCM through Example [1] and Fig. [1] 

Example 1. (continued). Let T = S ~ (iSi U 1 S 2 U S 3 ) be the structure 
returned by COA’s first recursive step fc = 0 for this example. Then a valid 
total causal mapping that can be returned at k = 1 (see FiaHU) is COA(T) = 
{(/ 4 ,X 4 ), (/ 5 ,X 5 )}. Since X 4 and X 5 are strongly coupled, COA maps them 
arbitrarily (e.g., it could be 1—)■ X5, /s X4 instead). Such total causal 
mapping p renders a cycle in the directed causal graph G^, induced by p (see 
FzofTcl). which might not be desirable for some applications. □ 

3.2. Hardness of Simon’s Recursive COA 

First of all, we state a decision problem associated with Ending the mini¬ 
mal structures in a given structure (line 3 of Simon’s RTCM procedure in 
Algorithm [1]). For short, we shall refer to this problem as the Complete 
Substructure Decision Problem (CSDP). 
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Algorithm 1 Simon’s Causal Ordering Algorithm based on RTCM. 

1: procedure COA(iS: structure over E and V) 

Require: S given is complete, i.e., \£\ = |V| 

Ensure: Returns (7+, the causal ordering of S 

2: ^ RTCM(5) > gets total causal mapping ip by Simon’s recursive algorithm 

Cip 0 0 initializes set of direct causal dependencies 

4: for all (/, x) G ip do 

5: for all Xa G Vars{f) \ {x} do 

6 : ^ {{xa,x)} 

7: return TQ{Cip) > returns the transitive closure of as described in 114.2I 

1: procedure RTCM(5: structure over ,7 and V) 

Require: Structure S given is complete, i.e., \£\ = |V| 

Ensure: Returns total causal mapping V 

2: (fi 0, S* 0, D ^ 0 initializes 

3: identify all minimal substructures S' 0 S 

4: for all minimal S' 0 S do 

5: 5* ^ 5* U 5' > aggregates into 5* each minimal substructure scanned 

6 : for all f G £' , where S' do 

7: X G- any Xa such that Xa C Vars{f) and Xa ^ D 

8: (f ipU {f, x) > maps to / some variable x G Vars(f) 

9: D ^ Du {cc} > aggregates into D the variables already ‘matched’ 

10: T ^ S G- S* removes £*; eliminates V* = U/ef* where n.b., V* = D 

11: if T 0 then 

12: return ip U RTCM(T) 

13: return ip 


(CSDP). Given a complete structure iS(£^, V) with \£\ = |V| = m and an 
integer !<£</«, does S have a complete substructure S'{£',V') with 

\£'\ = |v'| = a 


In this section we carry out an original study on CSDP and show that 
it is NP-Complete. We consider a basic observation by Nayak apud. 
H , that there is a correspondence between Simon’s structures and bipartite 
graphs. A graph is said bipartite if its vertices can be divided into two 
disioint sets Vi and V 2 and every edge connects a vertex in Vi to one in V 2 
jl5|. Moreover it is said to be i-balanced if |Pi| = IV 2 I = and is said to be 
connected if deg{w) > 1 for all tc G Gi U ¥ 2 - Def. E] introduces the mentioned 
correspondence and provides us some shorthand notation. 
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Xi 

X2 

X3 

Xi 

X5 

xe 

xr 


Figure 2: Bipartite graph G of structure S from Example [H 


Def. 8. Let S(L, V) be a structure, and G = {Vi U V 2 , E) he a bipartite graph 
where Vi £ and V 2 V, and E S so that an edge (/, x) E E if and 
only if we have x G Vars{f). We say that G is the bipartite graph that 
corresponds to structure S, and for short write G ^ S. 

Fig. [2] shows the bipartite graph G ^ S, where S is the initial structure 
given in Example [T] 

Def. [9] introduces a bipartite graph property of our interest, and then 
Lemma [ 2 ] originally establishes an equivalence of two problems: searching for 
complete substructures S' C S and searching for specihc bipartite subgraphs 
G' C G. 

Def. 9. Let G(EiU V 2 , E) be a bipartite graph. We say that G is structural 
if, for every ¥( C Vi, there is a connected bipartite subgraph G'{V( U V^^E') 
with iFj'l < m'l. (Note resemblance with Def. [2p. 


Lemma 2. Let S{V,£) be a complete structure with \£\ = |V| = m and 
1 < i < m provide an instance of CSDP. Let also G(Vi U V 2 , E) be a bipartite 
graph G ^ S. Then S has a substructure S' that gives a yes answer to CSDP 
if and only if G has a bipartite subgraph G'(y( U V 2 , E') such that G' ~ S' 
and all of these conditions hold: 

(i) Bipartite subgraph G' is structural; 
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Xi X2 X3 X4 

/i 1 0 1 0 

/2 1 1 0 0 

/s 0 1 1 0 

/4 1 1 1 [T] 



Xi 

X2 

X3 

X4 



(a) COA (2recursive steps), (b) Bipartite graph G. (c) Bipartite complement 

Figure 3: Another example of structure S with its correspondent bipartite 
graph G ~ iS. 


(ii) For every f G there is an edge {f,x) G E only if x E 
(in) Bipartite subgraph G' is i-balanced, that is, \Vl\ = \V)\ = 


Proof 3. We establish conditions (i-iii) as the bipartite subgraph properties 


that correspond to a yes answer to CSDP. See Appendix B. □ 


We now reach the key property in our argument to show COA’s hardness. 
A bicligue (or complete bipartite graph) is a bipartite graph G = {V 1 GV 2 , E) 
such that for every two vertices n G W, n G 1/2, we have (u, v) E E Thus 
the number of edges in a biclique is \E\ = |Fi| ■ IV 2 I. A biclique with partitions 
of size |Vi| = m and IV 2 I = n is denoted Km^n- For instance, the bipartite 
graph G shown in Fig. [2] has a K 2^2 biclique, viz., G'(V( U Vf,E'), where 
y( = {hJb}, Vi = {x 4 ,X 5 } and E’ = {{f4,X4), ihws)}- Let 

us now consider Example El 


Example 2. We introduce another structure S, whose structure matrix is 
shown in Fig. \3^ together with the bipartite graph G S in Fig. Let 
us consider subgraph G'{V( U Vi,E') in G that has V( = {/i,/ 2 ,/ 3 } and 
V) = {xi,X 2 ,X 3 }. Observe that we have G' ~ S', where S' G S is the 
complete substructure represented by the shaded 3x3 matrix in Fig. 

Note also that such bipartite subgraph G' satisfies the conditions (i-iii) of 
Lemma m and in fact S' is a complete substructure in S. Clearly, G' ~ S' is 
not a bicligue, as it is not the case that deg{w) = 3 for all w E V(U V). So 
there is no obvious connection between identifying complete substructures in 
a structure and bicligues in a bipartite graph. □ 
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The key insight to COA’s hardness conies as follows—consider Example [2] 
and Fig. Elfor illustration. Recall from Lemma |2]^ii) that, if we had an edge, 
say, connecting {fi,X 4 ) G E, then by Def. [T]the substructure S'{S',V') with 

= {/i,/ 2 ,/ 3 } would have V = [jf(z£:Vars{f) = {xi,X2,X3,X4} instead. 
That is, it would no more be a complete substructure. In fact, verifying such a 
negative property (Lemma |2Jii) in structural bipartite graphs translates onto 
a neat positive property (biclique) in the bipartite complement of bipartite 
graph G. 

The bipartite complement of a bipartite graph G(Vi U V 2 , E) is a bipartite 
graph G'^(ViUI 4 , where an edge (m, v) G E^^ iff (u, v) ^ E for every m G Vi 
and n G 14 . Given a bipartite graph G(ViUV 2 , E), it is easy to see that we can 
render G^iVi U V 2 , E^^) in polynomial time—consider, e.g., the biadjacency 
matrix of G (viz., the structure matrix in Fig. I3a| . and run a full scan on it 
to switch the boolean value of each entry in time 0(|Vi| • IV 2 I). Moreover, 
this operation is clearly invertible, as there is a one-to-one correspondence 
between G and G‘^. 

Fig. 133 shows the bipartite complement graph G^ of the bipartite graph 
G from Fig. |3bl Note that has a biclique K 31 with its vertices shaded 
in dark grey. To emphasize the point, if we had an edge (/i,X 4 ) G E (see 
Fig. [3b|) . then such a biclique would not exist in G'^ (see Fig. |3c])- We 
would have a iF 2,1 biclique instead with all edges in {/ 2 , /s} x {X 4 }, but note 
that 2 -|- 1 = 3 does not sum up to |Vi| = IV 2 I = m = 4. 

We can now establish the result we seek. We introduce below the Exact 
Node Cardinality Decision Problem (ENCD), which is a variant of biclique 
problem in bipartite graphs that is known to be NP-Complete (l^. p. 393]. 
Theorem [1] establishes its connection with CSDP. 


(ENCD). Given a bipartite graph G = (Vi U V 2 ; E) and two positive 
integers a, b, does G have a biclique Ka^ 


Theorem 1. CSDP is NP-Complete. 


Proof 4. We shall construct an instance of ENCD and describe its poly¬ 
nomial-time reduction to an instance of CSDP. We refer to Def. and 
Lemma [1 and present the argument in detail in Appendix C\ □ 
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Finally, we formulate an optimization problem associated with CSDP. We 
refer to it as the Minimal Substructures Problem (MSP). Corollary [T] then 
hnally establishes the hardness of Simon’s COA based on RTCM. 


(MSP). Given a complete structure iS(£^,V) with \£\ = |V| = m, list all 
its complete substructures S\£', V) with \£'\ = \V'\= I where 1 < I <m 

is minimal. 


Corollary 1. Let S be a complete structure. The extraction of its causal 
ordering by Simon’s COA{S) through its RTCM procedure reguires solving 
MSP, which is NP-Hard. 

Proof 5. Clearly, MSP is the optimization problem that needs to be solved 
at each recursive step k of Simon’s RTCM procedure — AlgorithmUi line 3, 
“find all minimal substructures S' C S. ” But MSP is clearly an optimization 
problem that subsumes CSDP, which we know from TheoremUl that is NP- 
Complete by a reduction from ENCD. 

In fact, an instance of ENCD' (as an optimization variant of ENCD) that 
can be reduced to MSP is as follows: given a bipartite graph G(Vi U V 2 , E) 
that bears the complement structural property (cf. Theorem{^ and has |Vi| = 
IV 2 I = nr, list all bicligues contained in G where 1 < I < m is 

minimal. In worst-case scenario, it reguires searching for all bicligues Kg^rn-i 
for each of the m — 1 possible values of 1. 

ENCD is NP-Complete, therefore ENCD' is NP-Hard. Accordingly, CSDP 
is NP-Complete (cf. Theorem{^ therefore MSP is NP-Hard. □ 

COP (Problem [1]), nonetheless can be solved efficiently by means of a 
different approach due to Nayak j^, which we describe in next section. 

4. Nayak’s Efficient Algorithm to COP 

The first part of COP requires finding a total causal mapping ip\ £ over 
a given complete structure S. While Simon’s RTCM goes into an intractable 
step, inspired by Serrano and Gossard’s work Q on constraint modeling 
and reasoning Nayak has found a polynomial-time approach to that task. 
We cover it next in all of its steps and see their complexity in some detail. 
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4-1. Total Causal Mappings 

Given a structure S, there may be more than one total causal mappings 
over S (recall Example [1]). So a question that arises is whether the transitive 
closure is the same for any total causal mapping ip over S; that is, whether 
the causal ordering of S is unique. Proposition |21 from Nayak j^, ensures 
that is the case. 

Before proceeding, we introduce Def. [10] in order to detach the notion of 
‘strongly coupled’ variables from ‘minimal structures’ (Def. |5|) and connect 
it to the concept ‘causal dependency’ (Def. Sj). 

Def. 10. Let S{£, V) be a structure with variables Xa, x^ G V, and (7+ be the 
set of causal dependencies induced by total causal mapping (p over S. We say 
that Xa and Xb are strongly coupled if we have both {xa,Xb), {xb,Xa) G G+. 

Recall from Example [1] the strongly coupled variables, x^ and x^. Now 
we can see it only in terms of causal dependencies. 


Proposition 2. Let iS(£^, V) be a structure, and tpi'. £ and (p 2 '. £ 
be any two total causal mappings over S. Then Ct = . 


Proof 6. The proof is based on an argument from Nayak which we 
present in a bit more of detail (see \Appendix D]) . Intuitively, it shows that 
if ipi and ip 2 differ in the variable an eguation f is mapped to, then such 
variables, viz., (pi{f) = Xa and (p 2 {f) = Xb, must be causally dependent on 
each other (strongly coupled). □ 


Another issue is concerned with the precise conditions under which total 
causal mappings exist (i.e., whether or not all variables in the equations can 
be causally determined). In fact, by Proposition jS] based on Nayak jil apud. 
Hall [l^, p. 135-7], we know that the existence condition holds if and only 
if the given structure is complete. We refer to Even [l^ to briefly introduce 
the additional graph-theoretic concepts that are necessary here: 


• A matching in a graph is a subset of edges such that no two edges in 
the matching share a common node. 

• A matching is said maximum if no edge can be added to the matching 
(without hindering the matching property). 
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• Finally, a matching in a graph is said ‘perfect’ if every vertex is an end¬ 
point of some edge in the matching — in a bipartite graph, a perfect 
matching is said to be a complete matching. 

Proposition 3. Let S{S,V) be a structure. Then a total causal mapping 
ip\ £ over S exists if and only if S is complete. 

Proof 7. We observe that a total causal mapping ip ■. £ ^ V over S cor¬ 
responds exactly to a complete matching M in a bipartite graph B = {ViU 
V 2 ,E), where V, and E S. In fact, by Even apud. Hall’s 

theorem (cf. \la. 135-7]), we know that B has a complete matching iff (a) 
for every subset of vertices E C Vi, we have |F| < \E{E)\, where E{E) is 
the set of all vertices connected to the vertices in E by edges in E; and (b) 
|Pi| = |l/2|. ByDefm (no subset of equations has fewer variables than equa¬ 
tions), and Def. IE (number of equations is the same as number of variables), 
it is easy to see that conditions (a) and (b) above hold iff S is a complete 
structure. □ 

The problem of hnding a maximum matching is a well-studied algorithmic 
problem. The Hopcroft-Karp algorithm is a classical solution [l^, bounded 
in polynomial time by 0{y\Vi\ -|- IV 2 I |-E|). It solves maximum matching in 
a bipartite graph efficiently as a problem of maximum flow in a network (cf. 
B p. 135-7], or B) P- 763]). That is, we can handle the problem of hnding 
a total causal mapping p over a structure S (see Alg. [2]) by hrst translating 
it to the problem of maximum matching in a bipartite graph in time 0(|iS|). 
Then we can just apply the Hopcroft-Karp algorithm to get the matching 
and hnally translate it back to the total causal mapping p. This procedure 
has been suggested by Nayak in connection with his Proposition [3] and its 
respective proof [^. 

Fig. m shows the complete matching found by the Hopcroft-Karp algo¬ 
rithm for the structure given in Example [H 
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Figure 4: Complete matching M for structure S from Example [T] 


Algorithm 2 Find a total causal mapping for a given structure. 

1: procedure TCM(5 : structure over E and V) 

Require: S given is a complete structure, i.e., \£\ = |V| 

Ensure: Returns a total causal mapping (p 
2 : B{ViUV2,E)^0 

3 : (p 0 

4: for all {f,X) € S do > translates structure 5 to a bipartite graph B 

5: El ^ El U {/} 

6: for all x G X do 

7: Ea ^ E2 U {x} 

8: E^EU{{f,x)} 

9: Mi— Hopcroft-Karp(R) > solves the maximum matching problem 

10: for all (/, x) G M do > translates the matching to a total causal mapping 

11 : p ^ pU{{f,x)} 

12: return p 


Corollary [2] and Remark [T] summarize the results presented in this note. 

Corollary 2. Let S{£^ V) he a complete structure. Then a total causal map¬ 
ping ip-. 8 over S can be found by (Alg. TCM in time that is bounded 
hyO{^\-\S\). 

Proof 8. Let B = {Vi U V 2 ,E) be the bipartite graph corresponding to com¬ 
plete structure S given to TCM, where Vi ^ £, V 2 ^ V, and E S. The 
















translation of S into B is done by a scan over it. This scan is of length 
|iS| = \E\. Note that number \E\ of edges rendered is precisely the length 
|iS| of structure, where the denser the structure, the greater |iS| is. The re¬ 
translation of the matching computed by internal procedure Hopcroft-Karp, in 
turn, is done at expense of \£\ = |V| < |iS|. Thus, it is easy to see that TCM 
is dominated by the maximum matching algorithm Hopcroft-Karp, which is 
known to be 0{-^\Vi\ + IV2I ■ \E\), i.e., 0{-^\8\ + |V| ■ |iS|). Since S is as¬ 
sumed complete, we have |£^| = |V| then \/\V\ + |V| = \/2 Therefore, 

TCM must have running time at most 0{^J\V\ ■ 151). □ 

f.2. Computing Transitive Closure 

Finally, recall that the set C^ of direct causal dependencies induced by a total 
causal mapping (p over a given structure 5(5, V) produces to the so-called 
‘causal graph,’ i.e., a directed graph (digraph) G(y,E) where V ^ V and 
E C^. So, computing set of causal dependencies given C^ corresponds 
to computing transitive closure (reachability links) on G. Note that \V\ = 
|V|, and also note that \E\ = \G^\ = |5| —|V| = 0(|5|). 

Classical algorithms for such task (e.g., Floyd-Warshall’s) are bounded in 
time 0(|V|^) (l9|, p. 697]. Another way to do it is by discovering reachability 
links using either one of the popular graph traversal algorithms, breadth-hrst 
search or depth-hrst search (DFS) |l9|, p. 603]. Algorithm [3] describes DFS- 
based transitive closure over digraph G{V,E). It runs in time 0(|ld| ■ |h^|), 
which means 0(|V| ■ |5|) for a complete structure 5(5, V). 

Algorithm 3 DFS-based transitive closure. 

1: procedure TC( G{V,E): digraph) > where G is such that F 1 —>■ V and E 1 —>■ G,p 
2: E+ ^ 0 

3: for all w G F do > for all vertices v in digraph G 

4: It ^ 0 > initializes D 

5: DFS(G, v, D) > discovers into D all u, where v is reachable from u 

6 : D D\ {v} > enforces an irreflexive transitive closure 

7: E+ ^[j^^j,{iu,v)}UE+ 

8: return G+(F,F;+) 

9: procedure DFS(G: digraph, v: vertex, It: global set of discovered vertices) 

10: D ^ DU {u} > labels v as discovered 

11: for all u where {u,v) G G do 

12: ii u ^ D then > vertex u is not yet labeled as discovered 

13: DFS(G,u,It) 

14: return 
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Remark 1. Let S{S,V) be a complete structure. Then we know (cf. Propo¬ 
sition m that a total causal mapping over S exists. Let it be defined (p = 
TCM{S), which can be done in |iS|). Then the causal ordering im¬ 

plicit in S can be correctly extracted (cf. Proposition\^ by computing the 
set of causal dependencies induced by ip, in terms of graph transitive closure 
(TC). The latter is bounded in time by 0{\V\ ■ \S\), that is, the complexity of 
COP is dominated by TC. 

In other words, the complete recipe to solve COP consists in replacing 
Simon’s RTCM by Nayak’s TCM in COA (Algorithm{If). Transitive closure 
(TC) in turn is computed as described in Algorithm\^ □ 

5. Conclusions 

Causal ordering inference is a classical problem in the AI literature, and 
still relevant in light of modern applications Q, e.g., large-scale hypothesis 
management and analytics j^. In this note we have shown that Simon’s 
classical algorithm (COA) tries to address an NP-Hard problem; and then 
we have given a detailed account on the state-of-the-art algorithms for the 
causal ordering problem (COP, stated as Problem d]). The key points are: 

• By Theorem [1] and Corollary [H we know (an original hardness result) 
that Simon’s approach to COP requires solving an NP-Hard problem; 

• From the seminal work of Simon Q (cf. §2]) and Nayak (cf. ^ and 
Propositions |2] and |3]), an approach is conveyed to solve COP efficiently; 

• By Corollary [2l we know how to process a complete structure into a 
total causal mapping in time that is bounded by 0(y^|v[- |iS|). This is a 
core step to solve COP, which Simon’s COA in turn makes intractable. 

• By Remark dl we know how to extract the causal ordering of a complete 
structure in time 0(|V| ■ |iS|), that is, in sub-quadratic time on the 
structure density (number of variable appearances). The machinery of 
causal ordering is then suitable for processing very large structures. 
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Appendix A. Proof of Proposition [1] 

Let S be a complete structure. If 81,82 C 8 are any different minimal 
substructures of 8 , then they are disjoint. 

Proof 9. We show the statement by case analysis and then contradiction 
out of Defs. [2H3 By assumption both 81,82 are minimal (hence complete). 
Let their size be |Vi| = \£i\ = I and IV 2 I = \£ 2 \ = m. Let also i < m. 
The argument is analogous otherwise but it shall be convenient to keep a 
placeholder for the size of the smaller substructure (with no loss of generality). 

By Def. 0 (minimal structures), we know that iSi ^ 82 and 81 ^ 82 . Now 
suppose 81,82 are not disjoint. Then by Def. 0 there must be at least one 
shared variable x G Vi, V 2 , and thus we must have IV 1 UV 2 I < i + m — 1. 

We can then proceed through case analysis by inguiring how many egua- 
tions are shared by 81 , 82 . Since 81 is minimal with \£i\ = |Vi| = I for 
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1 < i < m, the number of equations that are shared with S 2 could be any 
0 < k < i. (Note that the case of k = i shared equations would lead to the 
more obvious contradiction that Si C S 2 , even though S 2 is minimal). 

Let us start with the case £i H S 2 = 0 to illustrate the rationale in its 
simplest form. In this case, no equations are shared yet at least one variable 
is. Then we have \£i U £^ 2 ! = I + m, but |Vi U V 2 I < i + m — 1 . Since we 
have both iSi,iS 2 C S, in fact we have their sets of equations £\,£2 0 £ as 
well and then £iU £2 C £. Now, by Def. [ 2 ] (valid structure), we know that in 
any subset of k > 0 equations of S, at least k different variables must appear. 
But rather we have |£^i U £^ 2 ! = 1 + m and yet |Vi U V 2 I < I + m — 1. That is, 
we reach a contradiction to Def. [21 viz., \£i U £^ 2 ! > |Vi U V 2 I. 

The next case is when one equation is shared (\£i fl £^ 2 ! = !)■ Note that, if 
we had \£i\ = |Vi| = £ = 1 in particular then the only equation / G £^i would 
have \Vars{f)\ = 1 and be shared with £ 2 , making iSi C S 2 even though S 2 
is assumed minimal. ' 7 . We rather know that \£i\ = i > 2. Also, note that 
we must have \Vars{f)\ > 2 for all f & £ 1 , otherwise there would be some 
g E £i with \Vars{g)\ = 1 even though \£i\ > 2. That is, we would have a 
minimal substructure within iSi, although it is minimal. 

So, since one equation is shared and for all f E £i we have \ Vars{f)\ > 2, 
then at least two variables must be shared. Observe then that \£i U £^ 2 ! = 
£ + m — 1 (since exactly one equation is shared) and IV 1 UV 2 I <£ + m — 2 
(at least two variables are shared). Again, we see the same contradiction in 
face of Def. [21 viz., \£i U £”21 > |Vi U V 2 I. f. 

Now we complete the case analysis by making the argument abstract for 
any number of shared equations, 0 < k < i (an inductive step, n.b., is 
not required because k ^ N is bounded. Note that, for any such number 
0 < k < i, we must have at least k + 1 shared variables, otherwise the shared 
substructure having k equations, formed out of £\ ^£ 2 , would be minimal as 
well even though £^in £^2 ^ £\i £2 (that is, rendering both iSi,iS 2 non-minimal. 
^). However, once more we see that this contradicts Def. [21 □ 

Appendix B. Proof of Lemma [2] 

Let iS(V,£^) be a complete structure with \£\ = |V| = m and 1 < £ < m 
provide an instance of CSDP. Let also GiVi U V 2 , E) be a bipartite graph 
G ^ S. Then S has a substructure S' that gives a yes answer to CSDP if 
and only if G has a bipartite subgraph G'{V{ U Vf E') such that G' ~ S' and 
all of these conditions hold: 
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(i) Bipartite subgraph G' is structural; 

(a) For every f G V-[, there is an edge {f,x) G E only if x & V^; 

(Hi) Bipartite subgraph G' is ^balanced, that is, \V-l\ = \V)\ = 

Proof 10. First, we consider the ‘if’ statement—that is, all conditions (i- 
iii) together are sufficient. Let G' G G be a bipartite subgraph G'{V( U Vf, E') 
that satisfies all conditions (i-iii), and S\£',V') be a substructure of S with 
G' S'. We shall see that such S' does give a yes answer to CSDP, that is, 
it is a complete substructure with \£'\ = |V'| = 1. 

From condition (i) we know that G' is structural (Def. That is, for 
every V(' C V{, there is a connected bipartite subgraph G"{V(' U V 2 ,E") 
with \V{'\ < 1 ^ 2 '!. Since V) 1 —)■ £', Vf 1 —)■ V' and E' 1 —)■ S', such property 
bears obvious resemblance with Def. [I] (structure). That is, the ‘connected’ 
bipartite subgraph aspect implies that, for any subset of \£"\ eguations in £', 
at least \V"\ > \£"\ variables appear and each such variable x G V" appears 
in some f G £", otherwise x G V)' would be disconnected in G"{¥(' WV)', E"). 

Condition (ii) ensures in addition that U/ef' ^o,^s{f) = V'. That is, the 
variables in V' are exhaustive w.r.t. £'. Thus, structure S' satisfies Def. [3 
Finally, condition (Hi) ensures that S' is complete with \£'\ = |V'| = 

We consider now the ‘only if’ statement — i.e., every condition (i-iii) is 
necessary. We assume a bipartite graph G' ~ S' and show that lacking any 
one such condition implies that S' cannot be complete or cannot be a structure 
at all. First, it is easy to see that when condition (Hi) does not hold for G' 
then a structure S' with G' ~ S' cannot be complete. 

Now suppose condition (ii) does not for G'. That is, there is some f G V{ 
that has incidence with some x G V 2 \V). Thus we have V[ £' and Vf V' 
but U/ef' y0,'fsi.f) 7^ y■ Therefore either S' does not satisfy Def. [3 or we 
cannot actually have G' S'. '7. 

Finally, consider that G' is not structural (Def. That is, there is some 
V(' C V( such that no connected bipartite subgraph G"{V{' UV)', E") exists in 
G' with \V{'\ < 1 ^ 2 '!. Considering G' ~ S', that would imply for S'{£',V') 
either an eguation f E £' with no variables (a disconnected vertex x G V(), or 
a redundant subset of eguations—number of eguations is larger than number 
of variables appearing in it. Either conditions violate Def. [3 so S' cannot be 
a structure even though G' ^ S'. '7. □ 
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Appendix C. Proof of Theorem [T] 

CSDP is NP-Complete. 

Proof 11. We shall construct an instance of ENCD and describe its poly¬ 
nomial-time reduction to an instance of CSDP by using Lemma d. 

Constructing an instance of ENCD. Let G(ViUV 2 , E) be a bipartite 
graph such that, for every C Vi, there is a bipartite subgraph G'{y{UVf E') 
with \Vl\ < m'l and deg{f) < mi for all f G V{. Note that this is the 
complement property of the structural bipartite graph property (see Def. 

It is meant to ensure that the bipartite complement graph G^iVi U V 2 , E^) of 
G is structural—satisfies Def. 0 That is, when we produce G^, we know that 
it can possibly correspond to a structure S such that G^ ^ S. Let also G have 
1^11 = 1 ^ 2 ! = m in order to ensure that such structure S will be complete as 
well—recall that S given in CSDP is assumed complete indeed. 

Now let G and an integer 1 < i < m provide an instance of ENCD for 
integers a = i and b = m — i. That is, our problem is to decide whether G has 
a bicligue K^^rn-i- Imposing both of the above properties on G, n.b., incurs in 
no loss of generality w.r.t. ENCD as it does not open a pruning opportunity 
in searching for a bicligue Ke^m-e in powerset ViVi x V 2 )- Such a bicligue 
if existing in G, can be denoted G{yl U Vf,K), where \Vl\ = I and 
\Vf\ = m — i, and K is a complete set of edges between V( and Vf. Note 
also that V( C Vi and Vf <ZV 2 . 

Production of an instance of CSDP from the ENCD one. Let 

G^{yiUV 2 , E^) be the bipartite complement graph ofG, where an edge (/, x) G 
E'^ if and only if (/, x) ^ E for f E Vi and x EV 2 . Clearly, bipartite graph G^ 
can be produced in polynomial time from G—as mentioned in ^3.^ consider 
the ‘structure matrix’ (biadjacency matrix) of G and run a full scan on it to 
switch the boolean value of each entry in time 0(|Pi| • IV 2 I) and then get G^’. 
Decision problem correspondence. Now we show that a bicligue 
in G, if existing, corresponds to a bipartite subgraph G^ (y{ U Vf E^”) in 
that satisfies the conditions (i-iii) of Lemma\B That is, we show that a yes 
answer to ENCD implies a yes answer to CSDP. 

In fact, as G^’ is the bipartite complement graph of G, then the bicligue 
G{y{ U Vf , K) in G becomes a bipartite subgraph G'’{V( U Vf , 0) in G^. Now 
let G'^fV) U Vf E^') be such that Vf = 1^2 \ Vf. We observe that: 

(i) The presence of bicligue G{V( U Vf,K) in G indicates that Vf could 
not have contributed to satisfy the complement structural property for 
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VI, only V 2 = V 2 \ V" 2 * could. But such property turns into the structural 
property in G^, thus G^'(y( U must be structural indeed. That 

is, condition (i) of Lemma\^ is ensured. 

(a) By the fact that we have G'^iyi U Vf, 0) in G^ we know that, for all 
f G Vl, there can only be an edge (/, x) G if x G Vf indeed. That 
is, condition (ii) of Lemma\^ is ensured. 

(Hi) The presence of biclique GiVl U Vf,K) of form Ki^rn-e G implies 
that V) has size \Vl\ = Besides, V{ will have size \V)\ = IV 2 I — \Vf\ = 
m — {m — t) = i. That is, we must have \Vl\ = 11^21 = d and then 
condition (Hi) of Lemma\^is ensured as well. 

We have then established that the existence of a biclique G 0 G of form 
implies the existence of a bipartite subgraph G^' C G^, where G^' 
satisfies the conditions (i-iii) of Lemma That is, we get a yes answer to 
CSDP if we find one to ENCD. It remains to show the ‘only if’ part of the 
correspondence. 

In fact, suppose no biclique GiyiUVf, K) of form Ki^rn-i exists in G(ViU 
V 2 ,E). Clearly, it means that for any V( C Vi where \Vl\ = I, there is at 
least one f G V( such that an edge (/, x) with x E Vf is missing from E. 
Accordingly, in U V 2 , we cannot have G^' C G'’ with condition (ii) 

of Lemma [H satisfied. 

ENCD is NP-Complete. Thus CSDP must be NP-Complete as well. □ 

Appendix D. Proof of Proposition [2] 

Let S(S, V) be a structure, and (pi: E —)■ V and ip 2 '. E —)■ V be any two total 
causal mappings over S. Then Cf = Cf. 

Proof 12. The proof is based on an argument from Nayak which we 
reproduce here in a bit more of detail. Intuitively, it shows that if tpi and 
p >2 differ in the variable an equation f is mapped to, then such variables, 
viz., (pi{f) and (p 2 {f), must be causally dependent on each other (strongly 
coupled). 

To show Cf = Cf reduces to show both Cf C Cf and Cf C Cf. We 
show the first containment, and the second is understood as following by 
symmetry. Closure operators are extensive, X C cl{X), and idempotent. 
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c/(c/(X)) = cl{X). That is, if we have Ci C Cf, then we shall have Cf C 
{Cf)^ and, by idempotence, Cf C Cf. 

Then it suffices to show that Ci C Cf, i.e., for any [x', x) G Ci, we 
must show that {x', x) G Cif as well. Observe by Def. 0 that both ipi and ip 2 
are bijections, then, invertible functions. If(pf^{x) = ipf^{x), then we have 
{x', x) G C 2 and thus, trivially, {x', x) G Cf. Else, ipi and ip 2 disagree in 
which equations they map onto x. But we show next, in any case, that we 
shall have {x', x) G Cf. 

Take all equations g E £' S such that (pi{g) 7 ^ ^ 2 ( 9 ), and let n < 
\£\ be the number of such ‘disagreed’ equations. Now, let f E £' be such 
that its mapped variable is x = (pi{f). Construct a sequence of length 2n 
such that, sq = (pi{f) = x and, for I < i < 2n, element s* is defined 
Si = T 2 {Ti^( si-i))■ That is, we are defining the sequence such that, for each 
equation g G £', its disagreed mappings (pi{g) = Xa and (p 2 {g) = Xb are such 
that (pi^g) is immediately followed by (p 2 {g)- A.s Xa, Xb G Vars{g), we have 
{xa, Xb) G C 2 and, symmetrically, {xb, Xa) G Ci. The sequence is of form 
S = {X, Xf, . . . ,Xa, Xb, . . . , X2n-1, X2n) ■ 
f 9 h 

Since x must be in the codomain of ip 2 , we must have a repetition of x 
at some point 2 < k <2n in the sequence index, with Sk = x and Sk-i = x" 
such that [x”, x) E C 2 . If x" = x', then [x', x) E C 2 and obviously [x', x) E 
Cf. Else, note that xj must also be in the codomain of ipi, while x" in the 
codomain 0 /(^ 2 - Let i be the point in the sequence, 3 < i < 2n — l, at 
which Si = Xf = Xa and se+i = Xb for some Xb such that {xf, Xb) E C 2 . It 
is easy to see that, either we have Xb = x” or Xb 7 ^ x” but {xb, x”) E C^. 
Thus, by transitivity on such a causal chain, we must have {xf, x") G Cf 
and eventually {xf, x) E Einally, since x' E Vars{f) and </? 2 (/) = Xf, 
we have {x', Xf) E C 2 and, by transitivity, {x', x) G Cf. □ 
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